Supporting Realistic OpenMP Applications on a Commodity Cluster of Workstations
نویسندگان
چکیده
This paper describes compiler techniques that can translate standard OpenMP applications into code for distributed computer systems and optimization techniques to enhance the performance of realistic OpenMP application. OpenMP has emerged as an important model and language extension for shared-memory parallel programming. However, despite OpenMP's success on these platforms, it is not currently being used on distributed system. The goal of our project is to quantify the degree to which such a use is possible and develop supporting compiler techniques. Our present compiler techniques translate OpenMP programs into a form suitable for execution on a Software DSM system. We have implemented a compiler that performs this basic translation, and we have studied optimization techniques that improve the baseline performance of OpenMP applications on distributed computer systems. Our results show that, while kernel benchmarks can show high eÆciency of OpenMP programs on distributed systems, full applications need careful consideration of shared data access patterns. A naive translation (similar to OpenMP compilers for SMPs) leads to acceptable performance in very few applications only. However, additional optimizations, including computation repartitioning and access privatization result in average 300% (or from 20% to x16) speedup improvement on our SPEC OMPM2001 benchmarks.
منابع مشابه
Parallel Implementation of Exact Two Dimensional Pattern Matching Algorithms using MPI and OpenMP
The need for processing power is constantly increasing as more processing-demanding and time-critical applications appear. Parallel processing has emerged as an efficient and costeffective solution, since low-end workstations and multiprocessors are cheaply available in the commodity market. In this paper, experimental results are presented on the parallel processing of the Naive, Karp and Rabi...
متن کاملXML Opportunities in Real Time Immersive Simulation & Visualization Based on Clusters of Commodity Computers
Real Time Immersive Simulation and Visualization applications have been powered traditionally by high-end graphics workstations or supercomputers. But recently, clusters of commodity computers (PCs, Macintoshes, low cost workstations) have become a practical alternative. The advantages of a commodity cluster include low cost, flexibility, performance scalability and use of to legacy systems. Th...
متن کاملManaging Commodity Computer Cluster Oriented to Virtual Reality Applications
Virtual Reality (VR) applications have been powered traditionally by high-end graphics workstations or supercomputers. But recently, clusters of commodity computers (PCs, Macintoshes, low cost workstations) have become a practical alternative. The advantages of a commodity cluster include low cost, flexibility, access to technology, and performance scalability. The main goal of our proposal is ...
متن کاملPerformance Analysis of PC-CLUMP based on SMP-Bus Utilization
PC-CLUMP (Cluster of Multiprocessor) is one of the most cost-e ective commodity-based platforms for HPC applications. The increasing number of CPUs per SMP node realizes very compact system size and very low price on the network interface per processor keeping the number of CPUs in the system. However, the performance of SMP-bus on such an SMPPC node is relatively poor compared with that of SMP...
متن کاملManagement of Commodity Computer Cluster Oriented to Virtual Reality Applications
Virtual Reality (VR) applications have been powered traditionally by high-end graphics workstations or supercomputers. But recently, clusters of commodity computers have become a practical alternative. The advantages of a commodity cluster include low cost, flexibility, access to technology, and performance scalability. The main goal of our proposal is to offer both a High Performance Computing...
متن کامل